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(57) Abstract 

Sedoheptulose 1 ,7-bisphosphatase (SBPase) is an enzyme catalyzing the reaction converting sedoheptulose 1,7-bisphosphate into 
sedoheptulose 7-phosphate. This enzyme is located in the chloroplast in leaves and stems. Overexpression of SBPase in transgenic plants 
is provided to improve plant yield by increasing leaf starch biosynthetic ability in particular and sucrose production in general. Deregulated 
variants of the enzymes are also provided. 
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EXPRESSION OF SEDOHEPTULOSE L7 BISPHOSPHATASE 
IN TRANSGENIC PLANTS 

This application claims priority of the US provisional application, serial number 
60/133,964, filed on May 13, 1999. 

FIELD OF THE INVENTION 

This invention relates to the expression of sedoheptulose 1,7 bisphosphatase (SBPase) in 
transgenic plants to increase or improve plant growth and development, yield, vigor, and 
distribution of carbon assimilates. Transgenic plants expressing SBPase have improved carbon 
assimilation, export and storage in plant source and sink organs, which results in growth, yield 
and quality improvements in crop plants. 

BACKGROUND OF THE INVENTION 

Recent advances in genetic engineering have provided the prerequisite tools to transform 
plants to contain alien (often referred to as "heterogenous or heterologous") or improved 
endogenous genes. The introduction of such a gene in a plant would desirably lead to an 
improvement of an already existing pathway in plant tissues or introduction of a novel pathway 
to modify desired product levels, increase metabolic efficiency, and/or save on energy cost to the 
cell. It is presently possible to produce plants with unique physiological and biochemical traits 
and characteristics of high agronomic and crop processing importance. Traits that play an 
essential role in plant growth and development, crop yield potential and stability, and crop 
quality and composition are particularly desirable targets for crop plant improvement. These 
improvements may be achieved by genetically modifying a crop plant for improved carbon 
assimilation, more efficient carbon storage, and/or increased carbon export and partitioning 
capabilities. 

- - - Atmospheric carbon fixation- (photosynthesis) by plants, algae,' and photo synthetic 
bacteria represents the major source of energy to support processes in such organisms. The 
Calvin cycle, located in the stroma of the chloroplast, is the primary pathway of carbon 
assimilation in higher plants. Carbon assimilates can either leave the cycle for sucrose or starch 
biosynthesis or continue through the cycle to regenerate the carbon acceptor molecule, ribulose- 
1,5-bisphosphate. Sedoheptulose- l ? 7-bisphosphatase is an enzyme that catalyzes an essentially 
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irreversible reaction in the branch region where intermediates can leave the cycle, and therefore 
it may be essential to regulating carbon partitioning between the regeneration phase of the cycle 
and sucrose and starch biosynthesis. 

SBPase has no known cytosolic counterpart and is reported to be found only in the 
chloroplast, where it dephosphorylates sedoheptulose-l,7-bisphosphate (SBP) to form 
sedoheptulose-7-phosphate and inorganic phosphate. This enzyme is specific for SBP and is 
inhibited by its products as well as glycerate (Schimkat et al., 1990) and fructose-2,6- 
bisphosphate (Cadet and Meunier, 1988b). Light, a reducing agent, and Mg2+ are required for 
activity (Woodrow, 1982; Cadet and Meunier.. 1988a). The enzyme is a homodimer with a 
subunit molecular mass of 35-38 kDa (Nishizawa and Buchanan, 1981; Cadet and Meunier, 
1988c). 

It has been reported that removal of more than 80% of the enzymatic activity of SBPase 
in tobacco plants using antisense technology resulted in chlorosis, reduced growth rates, and 
reduced carbon assimilate levels (Harrison et al., 1998). Reduction in the quantum efficiency of 
photosystem II was also observed, which resulted in the reduction in carbohydrate content of the 
leaves. Analysis of carbohydrate status showed a shift from starch while sucrose levels were 
maintained. These results indicate that SBPase is a potential rate-limiting step in carbohydrate 
metabolism. 

Various sedoheptulose 1 ,7-bisphosphatases have been characterized biochemically, and 
the corresponding mRNAs (cDNA) have been cloned from an alga (Genbank accession number: 
X74418; Hahn and Kuck, 1994) and some higher plants such as Triticum aestivum (Genbank 
accession number: X65540; Miles et al., 1993), Spinacia oleracea (Genbank accession number: 
L76556; Martin et al., 1996) and Arabidopsis thaliana (Genbank accession number: S74719; 
Willingham, et al., 1994 ). Thus, over-expression of a nucleic acid sequence encoding SBPase in 
a transgenic plant will provide advantageous results in the plant such as improved carbon 
assimilation, export and storage; increased photosynthetic capacity; and extended photosynthetic 
ability. 
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SUMMARY OF THE INVENTION 

The present invention provides a method for improving the assimilation of carbon in 
plants using structural nucleic acid constructs that encode a sedoheptulose 1,7-bisphosphatase 
(SBPase) enzyme. 

In accomplishing the foregoing, there is provided, in accordance with one aspect of the 
present invention, a method for improving the assimilation of carbon in a plant comprising the 
steps of: 

(a) inserting into the genome of a plant a nucleic acid sequence comprising in the 5' to 3' 
direction and operably linked, 

(i) a promoter that functions in the cells of a selected plant tissue, 

(ii) a structural nucleic acid sequence that causes the production of a sedoheptulose 
1 ,7-bisphosphatase enzyme, 

(iii) a 3' non-translated nucleic acid sequence that functions in plant cells to cause 
transcriptional termination and the addition of polyadenylated nucleotides to the 
3' end of a RNA sequence; 

(b) obtaining transformed plant cells containing the nucleic acid of step (a); and 

(c) regenerating from transformed plant cells a transformed plant that overexpresses the 
sedoheptulose 1,7-bisphosphatase enzyme in the plant cells. 

In a further embodiment of the present invention an isolated nucleic acid sequence 
comprising a promoter capable of functioning in a plant cell, a structural nucleic acid sequence in 
sense orientation capable of causing the production of a sedoheptulose 1,7-bisphosphatase 
enzyme, and a 3' non-translated nucleic acid sequence capable of causing transcriptional 
termination and the addition of polyadenylated nucleotides to the 3' end of the transcribed 
mRNA sequence, is provided. This nucleic acid sequence may optionally include introns, 5' 
untranslated leader sequences or other nucleic acid sequences designed to enhance transcription 
and/or translation. 

In a still further embodiment of the invention, a novel, isolated nucleic acid sequence 
encoding a sedoheptulose 1 ,7-bisphosphatase enzyme from a green algae, Chlorella sorokxniana, 
is provided. 

In a yet further embodiment of the present invention, a variant nucleic acid sequence 
encoding a sedoheptulose 1.7-bisphosphatase enzyme is provided whereby the cysteine residues 
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in the polypeptide sequence are modified to another amino acid in a manner providing an active 
enzyme regardless of the presence of light. 

Therefore, in accordance with the present invention, a means for improving carbon 
assimilation, storage and export in the source tissues of a plant is provided. Further means of 
improved carbon accumulation in sinks (such as roots, tubers, seeds, stems, and bulbs) are 
provided, thus increasing the size of various sinks (larger roots and tubers) and subsequently 
increasing yield. The increased carbon availability to these sinks would improve and/or alter the 
composition of the cellular components of the plant (e.g., oils, proteins, starch and sucrose 
production and solids uniformity). One aim of the present invention is to overexpress 
sedoheptulose 1 ,7-bisphosphatases in plants by introducing a heterologous source of the 
sedoheptulose 1,7-bisphosphatase into the plant or by increasing the expression of the 
endogenous form of the gene in the plant. 

Various advantages may be achieved by the aims of the present invention. Increasing the 
expression of the sedoheptulose 1,7-bisphosphatase enzyme in the chloroplast would increase 
the flow of carbon through the Calvin Cycle and potentially increase atmospheric carbon 
assimilation in the presence of light . This would result in an increase in photosynthetic 
efficiency, an increase in chloroplast starch production (a leaf carbon storage form degraded 
during periods when photosynthesis is low or absent), and an increase in sucrose production by 
the leaf resulting in a net increase in carbon export to the sink and developing tissues would be 
expected during a given photoperiod. This increase in source capacity is a desirable trait in crop 
plants and would lead to increased plant growth, storage ability, yield and vigor. 

BRIEF DESCRIPTION OF THE DRAWINGS 
Figure 1 shows a plasmid map for cloning vector pMON47205. 
Figure 2 shows a plasmid map for cloning vector pMON47207. 
Figure 3 shows a plasmid map for cloning vector pMON47208. 
Figure 4 shows a plasmid map for plant transformation vector pMON 10098. 
Figure 5 shows a plasmid map for plant transformation vector pMON47200. 
Figure 6 shows a plasmid map for shuttle vector pMON999. 
Figure 7 shows a plasmid map for transient transformation vector pMON47203. 
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Figure 8 shows an immunoblot of SBPase protein expression in corn protoplasts transformed 
with pMON47203. 

Figure 9 shows a bar graph comparing SBPase activity in corn protoplasts expressing wheat 
SBPase with control protoplasts. 

Figure 10 shows an alignment of Sedoheptulose-l,7-bisphosphatase proteins. 

DESCRIPTION OF SEQUENCES 
SEQ ID NO: 1 synthetic primer 
SEQ ID NO:2 synthetic primer 

SEQ ID NO:3 DNA sequence for mature SBPase (no CTP) 

SEQ ID NO:4 synthetic primer 

SEQIDNO:5 adaptor primer 

SEQIDNO:6 synthetic primer 

SEQIDNO:7 adaptor primer 

SEQ ID NO:8 DNA sequence of SBPase with CTP 

SEQ ID NO:9 amino acid sequence of SBPase 

SEQ ID NO: 10 a degenerate primer 

SEQ ID NO:l 1 a gene-specific primer 

SEQ ID NO: 12 a predicted amino acid sequence 

SEQ ID NO: 13 a gene-specific primer 

SEQ ID NO: 14 a nested gene-specific primer 

SEQ ID NO: 15 a gene-specific primer 

SEQ ID NO: 1 6 a gene-specific primer 

SEQ ID NO: 17 a nested gene-specific primer 

SEQ ID NO: 18 a gene-specific primer 

SEQ ID NO:19 a vector-specific primer . ... ^ ....... . 

SEQ ID NO:20 the fiill length Chlorella sorokiniana SBPase cDNA sequence 

SEQ ID NO:21 a mutagenic primer to change cysteines 110 and 1 15 to serines in Chlorella 

sorokiniana SBPase. 

SEQ ID NO:22 a variant cDNA sequence in which cysteines 1 10 and 1 15 of Chlorella 
sorokiniana SBPase are changed to serines. 
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SEQ ID NO: 23 a predicted variant protein sequence of Chlorella sorokiniana SBPase in which 
cysteines 1 10 and 1 15 are changed to serines. 

DETAILED DESCRIPTION OF THE INVENTION 

The following definitions are provided in order to aid those skilled in the art in 
understanding the detailed description of the present invention. 

The term "identity" refers to amino acid or nucleic acid sequences that when compared 
using the local homology algorithm of Smith and Waterman (1981)) in the BestFit program 
(Wisconsin Package Version 10.0, Genetics Computer Group (GCG), Madison, Wise.) are 
exactly alike. 

The term "similarity" refers to amino acid sequences that when compared using the local 
homology algorithm of Smith and Waterman (1981) in the BestFit program (Wisconsin Package 
Version 10.0, Genetics Computer Group (GCG), Madison, Wise.) match when conservative 
amino acid substitutions are considered. 

«C-terminal region" refers to the region of a peptide, polypeptide, or protein chain from 
the middle thereof to the end that carries the amino acid having a free carboxyl group. 
The phrase "nucleic acid or DNA segment heterologous to the promoter region" means that the 
coding DNA or nucleic acid segment does not exist in nature with the promoter to which it is 
now operably linked or coupled therewith. 

The term "encoding DNA" refers to chromosomal DNA, plasmid DNA, cDNA, or 
synthetic DNA that encodes any of the enzymes discussed herein. 

The term "genome" as it applies to bacteria encompasses both the chromosome and 
plasmids within a bacterial host cell. Encoding DNAs of the present invention introduced into 
bacterial host cells can therefore be either chromosomally integrated or plasmid-localized. The 
term "genome" as it applies to plant cells encompasses not only chromosomal DNA found 
within the nucleus, but organelle DNA found within subcellular components of the cell. DNAs 
of the present invention introduced into plant cells can therefore be either chromosomally 
integrated or organelle-localized. 

The terms "microbe" or "microorganism" refer to algae, bacteria, fungi, and protozoa. 
The term "mutein" refers to a mutant form of a peptide, polypeptide, or protein. 
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"N-terminal region" refers to the region of a peptide, polypeptide, or protein chain from 
the amino acid having a free amino group to the middle of the chain. 

"Overexpression" refers to the expression of a polypeptide or protein encoded by a DNA 
introduced into a host cell wherein said polypeptide or protein is either not normally present in 
the host cell, or wherein said polypeptide or protein is present in said host cell at a higher level 
than that normally expressed from the endogenous gene encoding said polypeptide or protein. 

The term "plastid" refers to the class of plant cell organelles that includes amyloplasts, 
chloropiasts, chromoplasts, elaioplasts, eoplasts, etioplasts, leucoplasts, and proplastids. These 
organelles are self-replicating, and contain what is commonly referred to as the "chloroplast 
genome," a circular DNA molecule that ranges in size from about 120 to about 217 kb, 
depending upon the plant species, and which usually contains an inverted repeat region. 

This invention is directed to a method for producing plant cells demonstrating an 
increased or improved plant growth and development, yield and vigor. The method utilizes a 
DNA sequence encoding an sbpase (sedoheptulose 1,7-bisphosphatase) gene integrated in the 
cellular genome of a plant as the result of genetic engineering and causes expression of the 
SBPase enzyme. Plants that overexpress the SBPase enzyme and that have improved carbon 
assimilation, export and storage in plant source and sink organs, which results in growth, yield, 
and quality improvements are also contemplated in this invention. 

The mechanism whereby the expression of exogenous SBPase modifies carbon 
relationships derives from source-sink relationships. The leaf tissue is a sucrose source, and if 
more sucrose, resulting from the activity of increased SBPase expression, is transported to a sink 
it results in increased storage carbon (sugars, starch, etc.) per given weight of the sink tissue. 

A method for producing genetically transformed plants that express increased levels of 
SBPase requires the introduction of a double-stranded recombinant DNA molecule into the 
nuclear genome of a plant cell. The DNA molecule must (1) contain a structural DNA for the 
SBPase enzyme being introduced into the plant cell; (2) possess a promoter that functions in 
plants to regulate the production of an RNA sequence in a constitutive or tissue-specific manner 
by RNA polymerase enzyme; and (3) have a 3' -untranslated region that functions to cause 
transcriptional termination and the addition of polyadenylated nucleotides to the 3' end of the 
RNA. The resulting primary RNA molecule is subsequently processed in the nucleus, a process 
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that involves the removal of intronic sequences and the addition of polyadenylate nucleotides to 
the 3' end of the mRNA. 
Sedoheptulose L7-bisphosphatases 

As used herein, the term "sedoheptulose 1,7-bisphosphatase" means an enzyme (E.C. 
3.1.3.37) that catalyzes the dephosphorylation of sedoheptulose 1,7-bisphosphate to 
sedoheptulose 7 -phosphate and inorganic phosphate. 

The SBPase gene used in the DNA constructs of the present invention can be any SBPase 
gene. Numerous SBPase cDNA sequences are known in the art including the sequences from 
wheat (Raines et al., 1992), spinach (Martin et aL, 1996), Arabidopsis (Willingham et al., 1994), 
and Ralstonia (Yoo and Bowien, 1995) The examples provided herein are illustrative of the use 
an SBPase, and should not be interpreted in any way to limit the scope of the present invention. 
Individuals skilled in the art will recognize that various other genes as well as alterations can be 
made to genes and methods described herein while not departing from the spirit and scope of the 
present invention. For example, one could utilize a SBPase that has been selected from 
alternative organisms or that has been modified to lack or alter enzymatic feedback inhibition by 
orthophosphate and glycerate. Because SBPase is activated by thioredoxin or DTT, one could 
also use mutagenesis to manipulate the enzyme to remain in the activated state without the 
reducer present. These mutated forms can then be utilized to modify plant metabolism as well. 
The overproduction of carbohydrate may also provide an increase in tolerance to stresses that 
affect the water potential of the plants by providing more carbon skeletons. Further means of 
improved carbon accumulation in sinks (such as roots, tubers, seeds, stems, and bulbs) are 
provided, thus increasing the size of various sinks (larger roots and tubers) and subsequently 
increasing yield. The increased carbon availability to these sinks would improve cellular 
component composition (e.g., oil, protein, starch and sucrose production and solids uniformity). 
Thus, many different nucleic acid sequences that encode a sedoheptulose 1,7-bisphosphatase 
activity may be isolated and used in the present invention. 

The SBPase cDNA from Chlorella sorokiniana is identified herein and may 
advantageously be used in connection with this invention. This gene may also be deregulated 
with respect to light by alteration of the cysteine residues to another amino acid, e.g., serine. 
Preferably, any cDNA sequence encoding SBPase that is about 85% identical to the Chlorella 
sorokiniana cDNA is within the scope of this invention, more preferably such cDNA sequence 
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that is about 90% identical to the Chlorella sorokiniana cDNA, and most preferably about 95% 
identical to the Chlorella sorokiniana cDNA. Moreover, any amino acid sequence encoding 
SBPase that is about 85%, and more preferably about 90% similar to the Chlorella sorokiniana 
predicted amino acid sequence disclosed herein is considered within the scope of this invention. 

Bisphosphatase genes from the facultative chernoautotroph Alcaligenes eutrophus (now 

named Ralstonia eutropha) and from the algae Synechococcus lepoliensis encode proteins with a 
dual activity, FBPase and SBPase, in the Calvin cycle (Yoo and Bowien, 1995; Gerbling et al., 
1986). Genes encoding proteins with SBPase activity can be cloned from these or other 
organisms by anyone skilled in the art using w^ell established methods. The source of the SBPase 
genes for use in this invention as described is not limited to the organisms described above and 
should not be interpreted in any way to limit the scope of the present invention. In one 
embodiment of the invention, the SBPase gene may be fused to a chloroplast transit peptide, in 
order to target the SBPase protein to the plastid. Those skilled in the art will also recognize that 
various other chimeric constructs can be made that utilize the functionality of a particular plastid 
transit peptide to import the sedoheptulose-1 ,7-bisphosphatase enzyme into the plant cell plastid 
depending on the promoter tissue specificity. 
Gene construction and modifications 

A sedoheptulose 1,7 bisphosphatase enzyme considered in this invention includes any 
sequence of amino acids, such as protein, polypeptide, or peptide fragment, that demonstrates the 
ability to catalyze the dephosphorylation of sedoheptulose 1 ,7-bisphosphate to sedoheptulose 7- 
phosphate and inorganic phosphate. As described above, these can be sequences obtained from a 
heterologous source, such as algae, bacteria, fungi, and protozoa, or endogenous plant sequences, 
by which is meant any sequence that can be naturally found, in a plant cell, including native 
(indigenous) plant sequences as well as sequences from plant viruses or plant pathogenic 
bacteria. 

. It will be recognized by one of ordinary skill in the art that SBPase enzyme gene 
sequences may also be modified using standard techniques such as site-specific mutation or 
PCR, or modification of the sequence may be accomplished by producing a synthetic nucleic 
acid sequence and will still be considered a SBPase enzyme nucleic acid sequence of this 
invention. For example, wobble positions in codons may be changed such that the nucleic acid 
sequence encodes the same amino acid sequence, or alternatively, codons can be altered such 
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that conservative or nonconservative amino acid substitutions result. In either case, the peptide or 
protein maintains the desired enzymatic activity and is thus considered part of this invention. 

In one embodiment of the invention, the SBPase nucleic acid sequence is modified to 
change cysteine residues in the amino acid sequence to a different amino acid to prevent 
formation of disulfide bridges in the mature polypeptide between the cysteine residues providing 
an active enzyme regardless of the presence of light and also therefore prevent inactivation of the 
native protein by oxidation. For example, in the wheat SBPase, the cysteine residues at 
positions 52 and 57 (numbers correspond to mature wheat SBPase as described in Raines et al. 
(1999) are modified to a different amino acid such as a serine, alanine or glycine to prevent 
formation of the disulfide bond there between. Correspondingly, the cysteine residues at amino 
acid positions 1 10 and 115 of the Chlorella SBPase (corresponding to the Chlorella SBPase 
numbering in SEQ ID NO: 12) would also be modified for the same effect. 

A nucleic acid sequence to a SBPase enzyme may be a DNA or RNA sequence, derived 
from genomic DNA, cDNA, mRNA, or may be synthesized in whole or in part. The structural 
gene sequence may be cloned, for example, by isolating genomic DNA from an appropriate 
source, and amplifying and cloning the sequence of interest using a polymerase chain reaction 
(PCR). Alternatively, the gene sequences may be synthesized, either completely or in part, 
especially where it is desirable to provide plant-preferred sequences. Thus, all or a portion of the 
desired structural gene may be synthesized using codons preferred by a selected plant host. 
Plant-preferred codons may be determined, for example, from the codons used most frequently 
in the proteins expressed in a particular plant host species. Other modifications of the gene 
sequences may result in mutants having slightly altered activity. 

If desired, the gene sequence of the sbpase gene can be changed without changing the 
protein/amino acid sequence in such a manner as may increase expression and thus even more 
positively affect carbohydrate content in transformed plants. A preferred manner for making the 
changes in the sbpase gene sequence are set out in PCT Publication WO 90/10076. A gene 
synthesized by following the methodology set out therein may be introduced into plants as 
described below and result in higher levels of expression of the SBPase enzyme. This may be 
particularly useful in monocots such as maize, rice, wheat, sugarcane and barley. 
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Promoters 

A number of promoters that are active in plant cells have been described in the literature. 
These include the nopaline synthase (NOS) and octopine synthase (OCS) promoters (which are 
carried on tumor-inducing plasmids of Agrobacterium tumefaciens), the caulimovirus promoters 
such as the cauliflower mosaic virus (CaMV) 19S and 35S and the figwort mosaic virus (FMV) 
35S-promoters, the light-inducible promoter from the small subunit of ribulose-l,5-bis- 
phosphate carboxylase (ssRUBISCO), a very abundant plant polypeptide, and the chlorophyll aft 
binding protein gene promoters, etc. All of these promoters have been used to create various 
types of DNA constructs that have been expressed in plants; see, e.g., PCT publication WO 
84/02913. 

Promoters that are known to or are found to cause transcription of DNA in plant cells can 
be used in the present invention. Such promoters may be obtained from a variety of sources such 
as plants and plant viruses and include, but are not limited to, the enhanced CaMV35S promoter 
and promoters isolated from plant genes such as ssRUBISCO genes. It is preferred that the 
particular promoter selected should be capable of causing sufficient expression to result in the 
production of an effective amount of sedoheptulose 1,7-bisphosphatase enzyme to cause the 
desired increase in carbon assimilation, export and storage. Expression of the double-stranded 
DNA molecules of the present invention can be driven by a constitutive promoter, expressing the 
DNA molecule in all or most of the tissues of the plant. In addition, it may also be preferred to 
bring about expression of the sbpase gene in specific tissues of the plant, such as leaf or stem, 
and the promoter chosen should have the desired tissue and developmental specificity. Those 
skilled in the art will recognize that the amount of sedoheptulose 1 ,7-bisphosphatase needed to 
induce the desired increase in carbon assimilation, export, or storage may vary with the type of 
plant. Therefore, promoter function should be optimized by selecting a promoter with the desired 
tissue expression capabilities and approximate promoter strength and selecting a transformant 
that produces the desired sedoheptulose .1,7-bisphosphatase activity or the desired change in 
metabolism of carbohydrates in the target tissues. This selection approach from the pool of 
transformants is routinely employed in expression of heterologous structural genes in plants 
because there is variation between transformants containing the same heterologous gene due to 
the site of gene insertion within the plant genome (commonly referred to as "position effect"). In 
addition to promoters that are known to cause transcription (constitutively or tissue-specific) of 
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DNA in plant cells, other promoters may be identified for use in the current invention by 
screening a plant cDNA library for genes that are selectively or preferably expressed in the target 
tissues and then determine the promoter regions. 

For the purpose of expressing the sbpase gene in source tissues of the plant, such as the 
leaf or stem, it is preferred that the promoters utilized in the double-stranded DNA molecules of 
the present invention have increased expression in these specific tissues. For this purpose, one 
may choose from a number of promoters for genes with leaf-specific or leaf-enhanced 
expression. Examples of such genes known from the literature are the chloroplast glutamine 
synthetase GS2 from pea (Edwards et aL 1990), the chloroplast fructose- 1 ,6-bisphosphatase 
(FBPase) from wheat (Lloyd et ah, 1991), the nuclear photosynthetic ST-LS1 from potato 
(Stockhaus et al., 1989), and the phenylalanine ammonia-lyase (PAL) and chalcone synthase 
(CHS) genes from Arabidopsis thaliana (Leyva et al., 1995). Also shown to be active in 
photosynthetically active tissues are the ribulose-l,5-bisphosphate carboxylase (RUBISCO) 
isolated from eastern larch (Larix laricind) (Campbell et ah, 1994); the cab gene, encoding the 
chlorophyll a/b-binding protein of PSII, isolated from pine (cab6; Yamamoto et aL 1994), wheat 
(Cab-1; Fejes et al., 1990), spinach (CAB-1; Luebberstedt et al., 1994), and rice (cablR: Luan et 
al., 1992); the pyruvate orthophosphate dikinase (PPDK) from maize (Matsuoka et aL 1993); the 
tobacco Lhcbl*2 gene (Cerdan et al., 1997); the Arabidopsis thaliana SUC2 sucrose-H+ 
symporter gene (Truernit et al., 1995); and the thylacoid membrane proteins, isolated from 
spinach (psaD, psaF, psaE, PC, FNR, atpC, atpD, cab, rbcS; Oelmueller et al., 1992). Other 
chlorophyll a/b-binding proteins have been studied and described in the literature, such as LhcB 
and PsbP from white mustard (Sinapis alba; Kretsch et al., 1995). Promoters that cause the 
production of SBPase specifically in the stems, leaves, or specific cell types in these tissues are 
useful in the present invention. For example, the RbcS bundle sheath-specific promoter is one 
such tissue-specific promoter. Thus native promoters for maize, wheat, barley, and rice may be 
obtained and used in the ^present invention as well as heterologous promoters from other 
organisms shown to function in a constitutive/tissue-specific manner. Carbon metabolism in C4 
plants such as corn is more specialized than in C3 plants such as tobacco. In C4 plants, 
metabolites generated by the Calvin cycle in the chloroplast of bundle sheath cells must be 
transported to the cytoplasm of the mesophyll cells where sucrose biosynthesis takes place. 
Therefore, cell-specific promoters are needed for correct gene expression in the proper cell type 
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in C4 crops. For example, the RbcS bundle sheath-specific promoter would be useful for 
expressing SBPase in the appropriate cell type in corn. 

For the purpose of expressing an sbpase gene (encoding a light-regulated protein or a 
protein modified to be constitutively active) only when the plant is photosynthetically active, it is 
preferred, that the promoters utilized in the . double-stranded DNA molecules of the present 
invention have expression in the presence of light only. For this purpose, one may choose from a 
number of promoters for light regulated genes, including FBPase from wheat (Miles, et al., 
1993), pyruvate orthophosphate dikinase from maize (Sheen, 1991), and chloroplast aldolase 
from rice (Kagaya et al., 1 995). 

The RNA produced by a DNA construct of the present invention may also contain a 
5' non-translated leader sequence. This sequence can be derived from the promoter selected to 
express the gene and can be specifically modified so as to increase translation of the mRNA. 
The 5' non-translated regions can also be obtained from viral RNAs, from suitable eukaryotic 
genes, or from a synthetic gene sequence. The present invention is not limited to constructs, as 
presented in the following examples, wherein the non-translated region is derived from the 
5' non-translated sequence that accompanies the promoter sequence. Rather, the non-translated 
leader sequence can be derived from an unrelated promoter or coding sequence. 

Generally, optimal expression in monocotyledonous and some dicotyledonous plants is 
obtained when an intron sequence is inserted between the promoter sequence and the structural 
gene sequence or, optionally, may be inserted in the structural coding sequence to provide an 
interrupted coding sequence. An example of such an intron sequence is the HSP 70 intron 
described in WO 93/19189. 
Polvadenvlation signal 

The 3' non-translated region of the chimeric plant gene contains a polyadenylation signal 
that functions in plants to cause the addition of polyadenylate nucleotides to the 3' end of the 
RNA. Examples of suitable 3' regions are (1) the 3' transcribed, non-translated regions 
containing the polyadenylation signal of Agrobacterium tumor-inducing (Ti) plasmid genes, such 
as the nopaline synthase (NOS) gene, and (2) plant genes like the soybean storage protein genes 
and the small subunit of the ribulose-l,5-bisphosphate carboxylase (ssRUBISCO) gene. 
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pjastiMrectgd gXErgssion ofsgdohsBtuJose 1 7-hisnhosphatase activity 

In one embodiment of the invention, the sbpase gene may be fused to a chloroplast 
transit peptide, in order to target the SBPase protein to the plastid. As used hereinafter, 
chloroplast and plastid are intended to include the various forms of plastids includmg 
axnyloplasts. Many plastid-localized proteins are expressed from nuclear genes as precursors and 
are targeted to the plastid by a chloroplast transit peptide (CTP), which is removed dunng the 
import steps. Examples of such chloroplast proteins include the small subunit of nbulose-1,5- 
biphosphate carboxylase (ssRUBISCO, SSU), 5-enolpyruvateshikimate-3-phosphate synthase 
(EPSPS) ferredoxin, ferredoxin oxidoreductase, the light-harvesting-complex protein I and 
protein II, and thioredoxin F. The plastid targeting sequence can be, but is not limited to, the 
native chloroplast targeting peptide (CTP) identified in the wheat SBPase cDNA. It has been 
demonstrated that non-plastid proteins may be targeted to the chloroplast by use of protein 
fusions with a CTP and that a CTP sequence is sufficient to target a protein to the plastid. Those 
skilled in the art will also recognize that various other chimeric constructs can be made that 
utilize the functionality of a particular plastid transit peptide to import the enzyme into the plant 
cell plastid depending on the promoter tissue specificity. 
Combinations with "ther transgenes 

The effect of sbpase in transgenic plants may be enhanced by combining it with other 
genes that positively affect carbohydrate assimilation or content, such as a gene encoding for a 
sucrose phosphorylase as described in PCT Publication WO 96/24679, or ADPGPP genes such 
as the £ coli glgC gene and its mutant g/ g Cl 6. PCT Publication WO 91/1 9806 discloses how to 
incorporate the latter gene into many plant species in order to increase starch or solids. Another 
gene that can be combined with sbpase to increase carbon assimilation, export or storage is a 
gene encoding for sucrose phosphate synthase (SPS). PCT Publication WO 92/16631 discloses 
one such gene and its use in transgenic plants. Another gene with which SBPase can be 
combined is fructose 1 ,6-bisphosphate aldolase. 
Plant transformation/regeneration 
" ~ In developing the nucleic acid constructs of this invention, the various components of the 
construct or fragments thereof will normally be inserted into a convenient cloning vector, e.g., a 
, plasmid that is capable of replication in a bacterial host, e.g., E. coli. Numerous vectors exist 
that have been described in the literature, many of which are commercially available. After each 
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cloning, the cloning vector with the desired insert may be isolated and subjected to further 
manipulation, such as restriction digestion, insertion of new fragments or nucleotides, ligation, 
deletion, mutation, resection, etc. so as to tailor the components of the desired sequence. Once 
the construct has been completed, it may then be transferred to an appropriate vector for further 
manipulation in accordance with the manner of transformation of the host cell. 

A double-stranded DNA molecule of the present invention containing an sbpase gene can 
be inserted into the genome of a plant by any suitable method. Preferred methods of 
transformation of plant cells or tissue are the Agrobacterium mediated transformation method 
and the biolistics or particle-gun mediated transformation method. Suitable plant transformation 
vectors for the purpose of Agrobacterium mediated transformation include those derived from a 
Ti plasmid of Agrobacterium tumefaciens, as well as those disclosed, e.g., by Herrera-Estrella et 
al. (1983), Bevan (1984), Klee et al. (1985) and EPO publication 120,516. In addition to plant 
transformation vectors derived from the Ti or root-inducing (Ri) plasmids of Agrobacterium, 
alternative methods can be used to insert the DNA constructs of this invention into plant cells. 
Such methods may involve, but are not limited to, for example, the use of liposomes, 
electroporation, chemicals that increase free DNA uptake, free DNA delivery via microprojectile 
bombardment, and transformation using viruses or pollen. 

A plasmid expression vector suitable for the introduction of a sbpase gene in monocots 
using electroporation or particle-gun mediated transformation is composed of the following: a 
promoter that is constitutive or tissue-specific; an intron that provides a splice site to facilitate 
expression of the gene, such as the Hsp70 intron (PCT Publication W093/19189); and a 3' 
polyadenylation sequence such as the nopaline synthase 3' sequence (NOS 3'; Fraley et al., 
1983). This expression cassette may be assembled on high copy replicons suitable for the 
production of large quantities of DNA. 

An example of a useful Ti plasmid cassette vector for plant transformation is pMON- 
17227. This vector is described in PCT Publication WO 92/04449 and contains a. gene encoding 
an enzyme conferring glyphosate resistance (denominated CP4), which is an excellent selection 
marker gene for many plants. The gene is fused to the Arabidopsis EPSPS chloroplast transit 
peptide (CTP2) and expressed from the FMV promoter as described therein. When adequate 
numbers of cells (or protoplasts) containing the sedoheptulose-l,7-bisphosphatase gene or cDNA 
are obtained, the cells (or protoplasts) are regenerated into whole plants. Choice of methodology 
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for the regeneration step is not critical, with suitable protocols being available for hosts from 
Leguminosae (alfalfa, soybean, clover, etc.), Umbelliferae (carrot, celery, parsnip), Cruciferae 
(cabbage, radish, canola/rapeseed, etc.), Cucurbitaceae (melons and cucumber), Gramineae 
(wheat, barley, rice, maize, etc.), Solanaceae (potato, tobacco, tomato, peppers), various floral 
crops, such as sunflower, and nut-bearing trees, such as almonds, cashews, walnuts, and pecans. 
See, for example, Ammirato et al. (1984); Shimamoto et al. (1989); Fromm (1990); Vasil et al. 
(1990); Vasil et al. (1992); Hayashimoto (1990); and Datta et al. (1990). 

Plants that can be made to have enhanced and/or improved carbon assimilation, increased 
carbon export and partitioning by practice of the present invention include, but are not limited to, 
Acacia, alfalfa, aneth, apple, apricot, artichoke, arugula, asparagus, avocado, banana, barley, 
beans, beet, blackberry, blueberry, broccoli, brussels sprouts, cabbage, canola, cantaloupe, carrot, 
cassava, cauliflower, celery, cherry, cilantro, citrus, Clementines, coffee, corn, cotton, cucumber,' 
Douglas fir, eggplant, endive, escarole, eucalyptus, fennel, figs, gourd, grape, grapefruit, honey 
dew, jicama, kiwifruit, lettuce, leeks, lemon, lime. Loblolly pine, mango, melon, mushroom, nut, 
oat, okra, onion, orange, an ornamental plant, papaya, parsley, pea, peach, peanut, pear, pepper, 
persimmon, pine, pineapple, plantain, plum, pomegranate, poplar, potato, pumpkin, quince, 
radiata pine, radicchio, radish, raspberry, rice, rye, sorghum, Southern pine, soybean, spinach, 
squash, strawberry, sugarbeet, sugarcane, sunflower, sweet potato, sweetgum, tangerine, tea, 
tobacco, tomato, turf, a vine, watermelon, wheat, yams, and zucchini. 

The following examples are provided to better elucidate the practice of the present 
invention and should not be interpreted in any was to limit the scope of the present invention. 
Those skilled in the art will recognize that various modifications, truncations, etc., can be made 
to the methods and genes described herein while not departing from the spirit and scope of the 
present invention. 

EXAMPLES 

EXAMPLE 1 

cDNA cloning and overexpression ofSBPase for antibody production 

To isolate the region of the gene encoding the mature SBPase protein (no CTP), an RT- 
PCR reaction was performed. One microgram of Triticum aestivum, CV OSLO leaf RNA was 
combined with 100 pmo l of random hexamer primers (BRL/Life Technologies Inc, 
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Gaithersburg, MD) or with 100 pmol of oligo dT primer (Promega, Madison, WI), heated for 5 
minutes at 75°C, and chilled on ice. First strand cDNA synthesis was performed using 
Superscript II™ reverse transcriptase (BRL/Life Technologies Inc, Gaithersburg, MD) according 
to the manufacturer's protocol. The terminated reverse transcription reaction was diluted 1:7. 
Three microliters of the diluted first strand synthesis products were combined with 100 p,M of 
each dNTP, 50 pmol of a gene specific primer with homology to the 5' end of the gene designed 
to generate an Ndel cleavage site for subcloning purposes (5'- 
ACATATGTGCGCGATCGGCGA-3\ SEQ ID NO:l), 50 pmol of a gene specific primer with 
homology to the 3' end of the gene (5 ' -GGATC C AG AAG AAG ATTATTAGGCG-3 ' , SEQ ID 
NO:2), and 5 Units of PWO™ polymerase (Boehringer, Mannheim, Germany) in 100 |aL. PCR 
cycling conditions were as follows: 95°C, 40 seconds; 56°C. 1 minute; 72°C, 1 minute 30 
seconds (5 cycles) followed by 95°C, 40 seconds; 61 °C, 1 minute; 72°C, 1 minute 30 seconds 
(30 cycles). The 982 bp SBPase mature protein gene PCR product was gel purified, cloned into 
the PCR-Blunt cloning vector (Invitrogen, Carlsbad, CA) to form pMON47205 (Figure 1), and 
transformed into competent E. coli cells. Clones bearing inserts were selected on media 
containing kanamycin, plasmid purified, and digested with BamHI to select for the proper 
orientation in the cloning vector. Sequence analysis revealed that predicted amino acid sequence 
(SEQ ID NO:9) is identical to the published sequence (Raines et aL, 1992) starting at residue 73, 
although the nucleotide sequence (SEQ ID NO:3) differs at 2 positions (residue 294 from T to C; 
residue 572 from C to T). The selected plasmids were restricted with Ndel and BamHI , 
directionally cloned under the control of the IPTG inducible T7 polymerase promoter of pET 15b 
bacterial expression vector (Novagen, Madison, WI) linearized with Ndel/BamHI, and 
transformed into DH5a. Transformants were screened by Ndel/BamHI restriction analysis and 
clones possessing an insert were selected, plasmid purified, and transformed into E. coli 
BL21(DE3) for protein expression purposes. 

E. coli BL21(DE3) cells transformed with the pET15b-SBPase cDNA construct were 
induced with 2 mM IPTG for 2.5 hours after which a distinct protein band of about 38 kDa was 
apparent on an SDS-PAGE gel, which correlates with the size of the subunit polypeptide chain of 
the dimeric SBPase. Protein expressed by the mature SBPase was purified based on the affinity 
of histidine residues for immobilized nickel ions. The purification was performed under 
denaturing conditions using Ni-NTA Superflow resin (QIAGER Valencia, CA) as described in 
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the manufacturer's protocol. A 2 mL fraction containing 1 mg of purified SBPase protein 
emulsified with an equal volume of Complete Adjuvant (Sigma, St. Louis, MO) was used to 
inoculate a goat for antibody production using standard methods (Antech Company, St. Louis, 
MO). The preimmune serum showed no reactivity with SBPase. 

EXAMPLE 2 

cDNA cloning of SBPase for expression in plants 

To clone the region of the SBPase gene encoding the chloroplast transit peptide, a 
modified anchored PCR procedure for the rapid amplification of cDNA ends (Frohman, 1990; 
Jain et al., 1992) was used. Eight hundred nanograms of total RNA was combined with 10 ng of 
a gene-specific primer (5'- TTCCTC AG AGC AC GCGTACTTG-3 ' , SEQ ID NO:4) ? heated to 
75°C for 5 minutes, and chilled on ice. First strand DNA synthesis was performed using 
Superscript II™ reverse transcriptase (BRL/Life Technologies Inc, Gaithersburg, MD) according 
to the supplier's protocol. The terminated reverse transcription reaction was treated with one 
unit of ribonuclease H for 20 minutes at 37°C, 5 minutes at 95°C, and chilled on ice. Excess 
primers and dNTPs were removed by centrifugation at 2,000 x g through an Ultrafree-MC 
filterfiige (30,000 MW cutoff, Millipore, Bedford, MA), and the retentate was concentrated to 15 
HL on a Savant Speedvac (Savant Instruments, Holbrook, NY). The first-strand synthesis 
products were combined with 10 uL of tailing mix (IX tailing buffer [BRL/Life Technologies 
Inc, Gaithersburg, MD], 0.4 mM dATP, 10 units of terminal deoxytransferase) and incubated at 
37°C for 10 minutes. The reaction mixture was heated to 95°C for 5 minutes, diluted to 0.5 mL 
with TE, pH 8.0, and utilized as a cDNA pool. A mixture of 10 uL of the cDNA pool, 10 uL 
PWO™ polymerase 10X buffer (BRL/Life Technologies Inc, Gaithersburg, MD), 100 uM of 
each dNTP, 25 pmol of a gene specific primer (SEQ ID NO:4), 10 pmol of the poly(dT) adaptor 
primer (5'-GGGTCGACATTCTAGACAGAATTCGTGGATCC(T) 21 -3'; SEQ ID NO:5), and 5 
units of PWO™ polymerase (BRL/Life Technologies Inc, Gaithersburg, MD) in 100 uL was 
amplified. " PCR cycling conditions were as follows: 95°C, 2 minutes; 45°C, 5 minutes; 72°C, 40 
minutes (1 cycle) followed by 95°C, 50 seconds; 48°C, 1 minute; 72°C, 1 minute (3 cycles). The 
PCR products were purified away from the excess primers by ethanol precipitation. The pellet 
was resuspended in 50 uL of water and subjected to another round of amplification using a new 
nested gene specific primer (5 ' -C ATGGG AGT ACTCCAACGCCTC-3 ' , SEQ ID NO:6) and an 
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adaptor primer (5 ' -GGGTCGAC ATTCTAG AC AGAA-3 ' , SEQ ID NO:7). PCR cycling 
conditions were as follows: 95°C, 40 seconds; 58 0 C ? 1 minute; 72°C, 30 seconds (30 cycles). 
The -500 bp PCR product was gel purified, subcloned into the PCR-Blunt cloning vector 
(Invitrogen, Carlsbad, CA) to form pMON47207 (Figure 2), and transformed into One Shot Top 
10 Cells (Invitrogen, Carlsbad, CA) for further characterization. Sequence analysis (SEQ ID 
NO:8) showed that this sequence differs from the published sequence in 9 positions (30, C to T; 
42, G to C; 44, A to G; 187, C to A; 204, C to G; 228, C to G; 259, C to T; 348, G to C; and 351, 
C to G) resulting in three predicted amino acid changes (30, Arg to Cys; 44, His to Arg; 207, Gin 
to Glu). 

In order to test the expression of the wheat SBPase in plants, pMON47205 (Figure 1) and 
pMON47207 (Figure 2) were each restricted with Sail and. BamHI. The fragments containing 
sequence encoding the mature SBPase and the CTP region, respectively, were gel purified, and 
ligated together to form pMON47208 (Figure 3). pMON47208 was restricted with Xbal and 
BamHI and the sequence encoding SBPase was gel purified and ligated to XbaliBamHI - 
linearized pMON 10098 (Figure 4). The final vector formed was pMON47200 (Figure 5) with 
the CaMV E35S promoter, the coding sequence of the entire SBPase gene, the NOS 3'- 
untranslated polyadenylation region, and kanamycin resistance for selection in plants. 
EXAMPLE 3 

Transient expression of maize protoplasts 

In order to test the expression of the wheat SBPase subunits and their assembly into 
active enzymes, vectors were constructed to contain the CaMV E35S promoter, the coding 
sequence for the entire SBPase protein including the CTP, the NOS 3' termination signal, and 
ampicillin resistance for selection in E. coll The SBPase gene was isolated as an Xbal-BamHI 
fragment from pMON47200. The SBPase gene was ligated into the Xbal-BamHI CaMV E35S, 
NOS 3' bearing region of pMON999 (Figure 6) to give pMON47203 (Figure 7). The DNA 
^constac^were^lecti^gorated into maize protoplasts according to the method of Sheen . et al. 
(1991). 

EXAMPLE 4 

Analysis of transformed maize protoplasts 

Pelleted protoplast samples transformed with pMON47203 (SBPase) and no DNA were 
thawed in 0.18 mL of extraction buffer (50 mM HEPES pH 7.5, 1 mM fructose bisphosphate, 1 
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mM sedoheptulose-l 5 7-bisphosphate, 10 mM MgCl 2 , 10 mM MnCl 29 10 mM DTT, 1% 
polyvinylpyrrolidone, 10% glycerol, and Complete™ Protease Inhibitors (Boehringer, 
Mannheim, Germany)) on ice. The cells in each suspension were vortexed well and clarified at 
2,000 x g for 15 minutes. The supernatants were desalted using G25 spin columns (The Nest 
Group, Southboro, MA). Total protein content of the desalted protein was determined using the 
BioRad microprotein assay (BioRad, Hercules. CA) according to the manufacturer's protocol. 

Protein expression and size were determined by Western blot analysis (Figure 8). 
Significantly more protein was detected by cross-reactivity with goat anti-wheat SBPase 
antibodies in protoplasts transformed with pMON47203 than control protoplasts, indicating 
successful over-expression of SBPase enzyme in plant cells. The mobility of the wheat SBPase 
(pMON47203) expressed in corn protoplasts was approximately 38 kDa, the same as the 
endogenous wheat leaf SBPase, indicating correct processing of the CTP. 

SBPase activity was assayed by allowing hydrolysis of SBP for 10 minutes followed by 
measurement of phosphate liberation. The reaction was initiated by combining 20 |iL of buffer 
(100 mM Tris, 8.2, 10 mM MgCI 2 , 10 mM DTT, 1.5 mM EDTA, 10% glycerol) containing 5 jxg 
of desalted protein extract with 55 of assay buffer (50 mM Tris, 8.2, 10 mM MgCl 2 , 10 mM 
DTT, 1.5 mM EDTA, 0.27 mM sedoheptulose-l,7-bisphosphate). The reaction was stopped 
after a 10 minute incubation at room temperature by adding 30 pL of 1 M perchloric acid. The 
precipitated protein was removed by centrifugation, 250 |uL of developer (1% ammonium 
molybdate, IN HC1, 0.05% Malachite green (Itaya and Michio, 1966)) was added to 50 fxL of 
supernatant, and the amount of phosphate liberated was determined by measuring absorbance at 
660 nm. A phosphate standard curve (0.5-10 nmol) was utilized for quantitation. Protoplasts 
transformed with pMON47203 liberated 3-fold more phosphate from sedoheptulose-1,7- 
bisphosphate than control protoplasts (Figure 9). 

The high level of activity observed for the protoplasts transformed with wheat SBPase 
gene provides evidence that the SBPase protein detected by Western blot analysis is functional. 

EXAMPLE 5 

Wheat SBPase expression in tobacco 

In order to test the effects of over-expressing SBPase from wheat in tobacco, 
pMON47200 (Fig. 5) was prepared and used as the transformation vector for tobacco plant 
transformation. Tobacco plant cells were transformed by Agrobacterium mediated 
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transformation according to Horsch et al. (1985). The plant transformation vectors were 
mobilized into the ABI Agrobacterium strain by electroporation. 

Growth chamber-grown tobacco transformant lines were generated and first screened by 
Western blot analysis to identify expressors using goat antibody raised against wheat-expressed 
SBPase. It. .was not, however, possible to separate the endogenous tobacco SBPase from the 
transgenic wheat SBPase using gradient SDS-PAGE gels or isoelectric focusing gels for these 
tobacco transformant lines. 

EXAMPLE 6 

cDNA cloning of SBPase from Chlorella sorokiniana for expression in plants 
To determine the sequence of the gene encoding SBPase protein from Chlorella 
sorokiniana, several RT-PCR reactions were performed. One microgram of Chlorella 
sorokiniana RNA was combined with 100 pmol of oligo dT primer (Promega, Madison, WI), 
heated for 5 minutes at 75°C, and chilled on ice. First strand cDNA synthesis was perfonned 
using Superscript II™ reverse transcriptase (BRL/Life Technologies Inc, Gaithersburg, MD) 
according to the manufacturer's protocol. The terminated reverse transcription reaction was 
diluted 1:7. Twenty microliters of the diluted first strand synthesis products were combined with 
100 |aM of each dNTP, 50 pmol of a degenerate primer with homology to the 5' end of the gene 
(5'- GGIACIATHTTYGGIGTITGG -3', SEQ ID NO: 10), 50 pmol of a gene specific primer 
with homology to the 3' end of the gene (5'- RTAICKIARIGTRTAYTTYTC -3', SEQ ID 
NO: 11), and 5 Units of Taq polymerase (BRL/Life Technologies Inc, Gaithersburg, MD) in 100 
(iL. PCR cycling conditions were as follows: 95°C, 5 minutes (1 cycle) followed by 95 °C, 40 
seconds; 35°C, 1 minute; 72°C, 1 minute (5 cycles) followed by 95°C, 40 seconds; 40°C, 1 
minute; 72°C, 1 minute (30 cycles). The 550 bp PCR product that was 250 bp larger than the 
expected size was gel purified, cloned into the PCR-Blunt cloning vector (Invitrogen, Carlsbad, 
CA), and transformed into competent E. coli cells. Plasmid was purified from clones containing 
insert that had been selected on" media containing kanamycin/ Sequence" analysis revealedlthat 
the predicted amino acid sequence of this 550 bp PCR product is 41% identical to the 
Chlamydomonas reinhardtii sequence. Residues 226-251 of the Chlamydomonas reinhardtii 
sequence was 81% identical to the Chlorella sequence and residues 252-287 of the 
Chlamydomonas sequence was 86% identical but there was a gap encoding 62 Chlorella, 
residues presumably corresponding to an intron. Therefore this fragment was most likely 
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amplified from contaminating genomic DNA. The 297 bp of coding sequence was used to 
design primers for 5' RACE and 3' RACE to identify the sequence of the rest of the gene. 

To clone the remaining 5' sequence of the SBPase, a modified anchored PCR procedure 
for the rapid amplification of cDNA ends (Frohman, 1990; Jain et al., 1992) was used. Nine 
hundred nanograms of total Chlorella sorokiniana RNA was combined with 20 pmol of a gene- 
specific primer (5 '-GATGGTCTCGGTCTCCTTCACG-3 ' , SEQ ID NO: 13), heated to 75°C for 
5 minutes, and chilled on ice. First strand DNA synthesis was performed using Thermoscript™ 
reverse transcriptase (BRL/Life Technologies Inc, Gaithersburg, MD) at 55°C according to the 
supplier's protocol. The terminated reverse transcription reaction was treated with one unit of 
ribonuclease H for 20 minutes at 37°C, 5 minutes at 95°C, and chilled on ice. Excess primers 
and dNTPs were removed by centrifugation at 2,000 x g through an Ultrafree-MC filterfuge 
(30,000 MW cutoff, Millipore, Bedford, MA), and the retentate was concentrated to 15 uL on a 
Savant Speedvac (Savant Instruments, Holbrook, NY). The first-strand synthesis products were 
combined with 10 uL of tailing mix (IX tailing buffer [BRL/Life Technologies Inc, 
Gaithersburg, MD], 0.4 mM dATP, 10 units of terminal deoxytransferase) and incubated at 37°C 
for 10 minutes. The reaction mixture was heated to 95°C for 5 minutes, diluted to 0.2 mL with 
10 mM Tris, pH 8.5, and utilized as a cDNA pool. A mixture of 20 uL of the cDNA pool, 10 uL 
PWO™ polymerase 10X buffer (BRL/Life Technologies Inc, Gaithersburg, MD), 100 uM of 
each dNTP, 25 pmol of a gene specific primer (SEQ ID NO: 13), 10 pmol of the poly(dT) adaptor 
primer (5 ' -GGGTCGAC ATTCTAGAC AGAATTCGTGGATCC(T)2 1 -3 ' ; SEQ ID NO:5), and 5 
units of PWO™ polymerase (BRL/Life Technologies Inc, Gaithersburg, MD) in 100 uL was 
amplified. PCR cycling conditions were as follows: 95°C, 2 minutes; 45°C, 5 minutes; 72°C, 40 
minutes (1 cycle) followed by 95°C, 50 seconds: 48°C, 1 minute; 72°C, 1 minute (3 cycles). The 
PCR products were purified away from the excess primers by ethanol precipitation. The pellet 
wasresuspended in 50 uL of water and combined with 10 ul of PWO™ polymerase 10X buffer 
(BRL/Life Technologies Inc, Gaithersburg, MD) in lOOuJ, 50 \xM of each dNTP, 50 pmol a new 
nested gene specific primer (5'- CAGCCACTTGCCATCGTC -3', SEQ ID NO:14), 50 pmol of 
an adaptor primer (5 ' -GGGTCGACATTCTAG AC AGAA-3 ' , SEQ ID NO:7), 5 Units of 
PWO™ polymerase fBRL/Life Technologies Inc. Gaithersburg. MD ). PCR cycling conditions 
were as follows: 95°C, 40 seconds; 48°C. 1 minute; 72°C, 1 minute, 30 seconds (30 cycles). The 
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400 bp PCR product was gel purified, subcloned into the PCR-Blunt TOPO II cloning vector 
(Invitrogen, Carlsbad, CA) and transformed into competent E. coli cells for further 
characterization. Sequence analysis showed that this PCR fragment was part of the 5' sequence 
of the Chlorella sorokiniana SBPase. 
5 This, sequence was then used to design additional primers to clone the remaining 5' 

sequence of the SBPase using the modified anchored PCR procedure for the rapid amplification 
of cDNA ends (Frohman, 1990; Jain et al. ? 1992). Nine hundred nanograms of total RNA was 
combined with 20 pmol of a gene-specific primer (5 '-GATGGTCTCGGTCTCCTTC ACG-3 ' , 
SEQ ID NO: 15), heated to 75 °C for 5 minutes, and chilled on ice. First strand DNA synthesis 
10 was performed using Thermoscript™ reverse transcriptase (BRL/Life Technologies Inc, 
Gaithersburg, MD) at 55°C according to the supplier's protocol. The terminated reverse 
transcription reaction was treated with one unit of ribonuclease H for 20 minutes at 37°C, 5 
minutes at 95°C, and chilled on ice. Excess primers and dNTPs were removed by centrifugation 
at 2,000 x g through an Ultrafree-MC filterfuge (30,000 MW cutoff, Millipore, Bedford, MA), 

15 and the retentate was concentrated to 15 fiL on a Savant Speedvac (Savant Instruments, 
Holbrook, NY). The first-strand synthesis products were combined with 10 ^iL of tailing mix 
(IX tailing buffer [BRL/Life Technologies Inc, Gaithersburg, MD], 0.4 mM dATP, 10 units of 
terminal deoxytransferase) ancl incubated at 37°C for 10 minutes. The reaction mixture was 
heated to 95°C for 5 minutes, diluted to 0.2 mL with TE, pH 8.0, and utilized as a cDNA pool. A 

20 mixture of 20 ^L of the cDNA pool, 10 uL HotStarTaq™ polymerase 10X buffer fOiagen. 
Valencia. CA), 20 ill of O 5X buffer TOiagen. Valencia, CA) 50 nM of each dNTP, 25 pmol of a 
gene specific primer (5'-TCCTCAGAGCACGCCAGCTTGC-3' , SEQ ID NO:16), 10 pmol of 
the poly(dT) adaptor primer (5 5 -GGGTCGACATTCTAGACAGAATTCGTGGATCC(T)2]-3 > ; 
SEQ ID NO:5), and 5 units of HotStarTaq™ polymerase (Oiagen. Valencia, CA ) in 100 \xL was 

25 amplified. PCR cycling conditions were as follows: 95°C, 15 minutes; 45°C, 5 minutes; 72°C, 
40 minutes (1 cycle) followed by 95°C, 50 seconds; 48°C, 1 minute; 72°C, 1 minute (3 cycles). 
The PCR products were purified away from the excess primers by ethanol precipitation. The 
pellet was resuspended in 50 ^L of water and combined with 10 HotStarTaq™ polymerase 
IPX buffer fOiagen. Valencia. CA), 20 yj of O 5X buffer fOiagen, Valencia, CA), 50 uM of 

30 each dNTP 50 pmol of new nested gene specific primer (5'- 
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AGCTGCTC ATCGCCGAACGAGTTG-3 \ SEQ ID NO: 17), 50 pmol of an adaptor primer (5'- 
GGGTCGACATTCTAGAC AGAA-3 ? , SEQ ID NO:7) ? and 5 Units of HotStarTaq™ 
polymerase fOiaeen, Valencia, CA ). PCR cycling conditions were as follows: 95°C 15 minutes 
H cycled followed by 95°C, 40 seconds; 50°C 1 minute; 72°C, 1 minute, 30 seconds (30 cycles). 
The 380 bp PCR product was gel purified, subcloned into the PCR-Blunt TOPO II cloning 
vector (Invitrogen, Carlsbad, CA) and transformed into competent E. coli cells for further 
characterization. Sequence analysis showed that this PCR fragment contains the remaining 5' 
sequence of the Chlorella SBPase. 

PCR was used to isolate the remaining 3' sequence of the SBPase from a Chlorella 
sorokiniana cDNA library in pSportl constructed using Superscript™ Plasmid System for 
cDNA Synthesis and Plasmid Cloning (BRL/Life Technologies Inc, Gaithersburg, MD) 
according to the supplier's protocols. One microgram of Chlorella sorokiniana RNA was 
combined with 100 pmol of oligo dT primer (Promega, Madison, WI), heated for 5 minutes at 
75°C 5 and chilled on ice. First strand cDNA synthesis was performed using Thermoscript™ 
reverse transcriptase (BRL/Life Technologies Inc, Gaithersburg, MD) at 60°C according to the 
manufacturer's protocol instead of the Superscript™ reverse transcriptase (BRL/Life 
Technologies Inc, Gaithersburg, MD) provided with the kit. A mixture of 200 ng of library 
DNA, 10 fiL HotStarTaq™ polymerase 10X buffer (Oiagen. Valencia, CA ). 50 jiM of each 
dNTP, 50 pmol of a gene specific primer (5 ? - AGTTCCTGCTGCAGGACGATGG -3'; SEQ ID 
NO:18), 50 pmol of the vector specific primer (5'-CCCAGTCACGACGTTGTAAAACG -3'; 
SEQ ID NO: 19), and 5 units of HotStarTaq™ polymerase ( Oiagen. Valencia. CA) in 100 ^L 
was amplified. PCR cycling conditions were as follows: 95°C 5 15 minutes (1 cycle) followed 
by 95°C, 40 seconds; 62°C, 1 minute; 72°C, 1 minute, 30 seconds (30 cycles). The overlapping 
5' and 3' sequences were assembled to give the full length Chlorella sorokiniana SBPase 
sequence (SEQ ID NO:20) with predicted amino acid sequence (SEQ ID NO: 12). 



Table 1 . Comparison of Chlorella sorokiniana SBPase derived amino acid sequence (SEQ ID 
NO: 12) with known SBPase amino acid sequences. The alignments were made using the Bestfit 
program to determine percentage homology in the table below. BestFit program uses the local 
homology algorithm of Smith and Waterman (1981) to find the best segment of similarity 
between two sequences. 
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Similarity 
76.7% 
82.0% 
74.3% 
72.0% 



Identity 



Arabidopsis thaliana 
Chlamydomonas reinhardtii 
Spinacia oleracea 
Triticum aestivum 



71.6% 
78.0% 
68.4% 
66.8% 



EXAMPLE 7 

Cloning genes encoding dual FBPase and SBPase activity 

_ Bisphosphatase genes from the facultative chemoautotroph Ralstonia eutropha 

(Alcaligenes eutrophus) and from the algae Synechococcus lepoliensis encode proteins with a 

dual activity, FBPase and SBPase, in the Calvin cycle (Yoo and Bowien, 1995; Gerbling et al., 

1986). A gene encoding a protein with SBPase activity will be cloned from Ralstonia eutropha 

using PCR based on the sequence described in Yoo and Bowien(1995). This gene will then be 

fused to a chloroplast transit peptide, in order to target the SBPase protein to the plastid. 

EXAMPLE 8 

Chlorella and Ralstonia SBPase expression in tobacco 

In order to test the effects of over-expressing wild type or deregulated Chlorella as well 
as Ralstonia SBPase in tobacco, each gene will first be cloned into the pCR®-Blunt II™ cloning 
vector (Invitrogen, Carlsbad, CA). Each vector will then be digested with EcoRI and the gel 
purified fragments encoding each of the SBPases will be ligated individually to EcoRI- 
linerarized pMON 10098 (Figure 4). Each of the 3 vectors will be used for tobacco plant 
transformation, as described in Horsch et al. (1985). The plant transformation vectors are 
mobilized into the ABI Agrobacterium strain by electroporation. 

Growth chamber-grown tobacco transformant lines are generated and first screened by 
Western blot analysis to identify expressors using goat antibody raised against wheat-expressed 
SBPase. Subsequently, for wild type Chlorella-SBPasc, deregulated Chlorella SBPase- and 
Ralstonia SBPase-expressing tobacco lines, leaf samples taken at various stages of plant 
-development will be analyzed for diurnal changes in leaf nonstructural carbohydrates- (sucrose, - 
glucose, and hydrolyzed starch into glucose) by means of the Sucrose/D-Glucose/D-Fructose kit 
(Boehringer Mannheim, G ermany ) . 

Thirty to fifty milligrams of frozen tobacco leaf tissue samples will be incubated in 1 mL 
of 85°C water for 15 minutes. Tubes will be centrifuged for 1 minute at 10,000 x g and the 
supematants saved for soluble sugar analysis. The pellet will be resuspended in 1 mL of 85°C 



WO 00/70062 



PCT/US00/13238 



-26- 



water, mixed with a Vortex, and centrifuged as described above. The supernatant will be 
carefully removed and added to the previous supernatant fraction for soluble sugar (sucrose and 
glucose) analysis by the Sucrose/D-Glucose/D-Fructose kit (Boehringer Mannheim, Germany). 

The starch will be extracted from the pellet using by adding 3 mL of 0.1 M sodium 
acetate, pH 5.6 to the pellet and incubating at 90°C for 10 minutes. Once cool, 3 mL of 1% 
amyloglucosidase in 0.1 M sodium acetate, pH 5.6 will be added and vortexed. The samples will 
be incubated in a water bath at 50°C for 3 hours, vortexing every half hour. After centrifugation 
in a table top centrifuge at 900 x g for 30 minutes, the supematants will be analyzed for glucose 
content. The free glucose will be adjusted to anhydrous glucose (as it occurs in starch by 
multiplying by the ratio 162/182). 

Alteration of carbon assimilation in the transgene plant tissue can be monitored by 
metabolic profiling using standard HPLC, mass spectroscopy, and enzymatic based metabolite 
assays. 

EXAMPLE 9 
Deregulation ofSBPase 

In the dark, wheat SBPase is inactive due to disulfide bond formation between Cys-52 
and Cys-57 (numbers correspond to mature wheat SBPase, CA Raines et al. 1999. J. Exp. Bot. 
50: 1-8). In the light, thioredoxin reduces the disulfide linkage resulting in an active enzyme. 
Modifying the Cys residues through mutagenesis would prevent formation of the disulfide bond 
and therefore prevent inactivation of the protein by oxidation. Site-directed mutagenesis to 
change Cys-1 10 and Cys-1 15 (numbers correspond to Chlorella SBPase including the CTP, SEQ 
ID NO:12) to Ser will be performed using the QuikChange™ Site-Directed Mutagenesis Kit 
(Stratagene, La Jolla, CA) using a mutagenic primer (5'- 
AAGGTGCGCACCGCCTCGTCCGGCGGCACCGCCTCCGTCAACTCGTTCGGCGATG - 

3'; SEQ ID NO: 21) according to the manufacturer's protocol. The resulting sequence SEQ ID 
NO:22-will be. inserted- into. the. appropriate- transformation vector and will be used to transform 
tobacco and corn. The resulting plants will be tested for expression of the SBPase in the plant 
tissues and enzymatic assays will be run to confirm activity. The plants will be analyzed to 
determine the improved assimilation of carbon in the plant and its export and storage to sink 
tissues. The predicted amino acid sequence resulted from the DNA sequence (SEQ ID NO:22) is 
shownasSEQIDNO:23. 
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It should also be understood that the examples and embodiments described herein are for 
illustrative purposes only and that various modifications or changes in light thereof will be 
suggested to persons skilled in the art and are to be included within the spirit and purview of this 
application and the scope of the appended claims. 

All publications and patent applications mentioned in this specification are indicative of 
the level of skill of those skilled in the art to which this invention pertains. All publications and 
patent applications are herein incorporated by reference to the same extent as if each individual 
publication or patent application was specifically and individually indicated to be incorporated 
by reference. 
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CLAIMS: 

1. A method for improving the assimilation of carbon in a plant comprising the steps of: 

a) . inserting into the genome of a plant a nucleic acid sequence comprising in the 5' 

to 3' direction, 

i) . a promoter that functions in the cells of said plant, said promoter operably 

linked to; 

ii) . a structural nucleic acid sequence that causes the production of a 

sedoheptulose 1,7-bisphosphatase enzyme, said structural nucleic acid 
sequence operably linked to; 

iii) . a 3' non-translated nucleic acid sequence that functions in said cells of 

said plant to cause transcriptional termination; 

b) . obtaining transformed plant cells containing the nucleic acid sequence of step (a); 

and 

c) . regenerating from said transformed plant cells a transformed plant that 

overexpresses said sedoheptulose 1 ,7-bisphosphatase enzyme in said plant 
cells. 

2. The method of claim 1 wherein said structural nucleic acid sequence that causes the 
production of a sedoheptulose 1 ,7-bisphosphatase enzyme is from a source heterologous with 
respect to the plant into which the structural nucleic acid sequence is inserted. 

3. The method of claim 2 wherein said structural nucleic acid sequence that causes the 
production of a sedoheptulose 1,7-bisphosphatase enzyme is from a bacterial microorganism. 

4. The method of claim 3 wherein said structural nucleic acid sequence that causes the 
production of a sedoheptulose 1,7-bisphosphatase enzyme is from Ralstonia. 

5. The method of claim 2 wherein said structural nucleic acid sequence that causes the 
-production of a sedoheptulose 1,7-bisphosphatase enzyme is from a green algae. 

6. The method of claim 5 wherein said structural nucleic acid sequence that causes the. 
production of a sedoheptulose 1,7-bisphosphatase enzyme is from Chlorella. 

7. The method of claim 2 wherein said structural nucleic acid sequence that causes the 
production of a sedoheptulose 1,7-bisphosphatase enzyme is from a plant species heterologous 
with respect to the plant into which the structural nucleic acid sequence is inserted. 
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8. The method of claim 1 wherein said structural nucleic acid sequence that causes the 
production of a sedoheptulose 1,7-bisphosphatase enzyme is endogenous to said plant into which 
it is inserted and said promoter operably linked thereto is capable of expressing said inserted, 
endogenous sedoheptulose 1,7-bisphosphatase enzyme at a level greater than that of a plant not 
transformed to contain said inserted structural nucleic acid sequence. 

9. The method of claim 1 wherein said plant is a monocotyledonous plant. 

10. The method of claim 1 wherein said plant is a dicotyledonous plant. 

1 1 . The method of claim 1 wherein said plant is selected from the group consisting of maize, 
wheat, rice, potato alfalfa, barley, cotton, soybean, canola, sunflower, and sugarbeet. 

12. The method of claim 1 wherein said sedoheptulose 1,7-bisphosphatase enzyme is 
deregulated with respect to light. 

13. An isolated nucleic acid sequence comprising: 

(a) a promoter that functions in the cells of a plant, said promoter operably linked to; 

(b) a structural nucleic acid sequence in sense orientation that causes the production 

of a sedoheptulose 1 ,7-bisphosphatase enzyme, said structural nucleic acid 
sequence operably linked to, and 

(c) a 3' non-translated nucleic acid sequence that functions in said cells of said plant 

to cause transcriptional termination. 

14. The isolated nucleic acid sequence of claim 13 further comprising an intron. 

15. The isolated nucleic acid sequence of claim 13 further comprising a 5 5 untranslated 
leader sequence. 

16. The isolated nucleic acid sequence of claim 13 wherein said structural nucleic acid 
sequence that causes the production of a sedoheptulose 1,7-bisphosphatase enzyme "is from a 
bacterial microorganism. 

17. The isolated nucleic acid sequence of claim 16 wherein said structural nucleic acid 
sequence that^causes the production of a sedoheptulose 1,7-bisphosphatase enzyme is from 
Ralstonia. 

18. The isolated nucleic acid sequence of claim 13 wherein said structural nucleic acid 
sequence that causes the production of a sedoheptulose 1,7-bisphosphatase enzyme is from a 
green algae. 
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19. The isolated nucleic acid sequence of claim 18 wherein said structural nucleic acid 
sequence that causes the production of a sedoheptulose 1 ,7-bisphosphatase enzyme is from 
Chlorella. 

20. The isolated nucleic acid sequence of claim 19 wherein said structural nucleic acid 
sequence that causes the production of a sedoheptulose 1 ,7-bisphosphatase enzyme is from a 
plant. 

21. The isolated nucleic acid sequence of claim 13 wherein said sedoheptulose 1 ,7- 
bisphosphatase enzyme is deregulated with respect to light. 

22. An isolated nucleic acid sequence comprising the nucleotide sequence of SEQ ID NO:20, 
or a degenerate variant of SEQ ID NO:20. 

23 . An isolated nucleic acid sequence comprising a sequence that encodes a polypeptide with 
the amino acid sequence of SEQ ID NO: 12 or SEQ ID NO: 12 with conservative amino acid 
substitutions. 

24. An isolated nucleic acid sequence comprising a sequence at least 85% identical to SEQ 
IDNO:20. 

25. An isolated nucleic acid sequence comprising a sequence that encodes a polypeptide the 
amino acid sequence of which is at least 85% similar to SEQ ID NO:12. 
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SEQUENCE LISTING 



<110> Miller, Philip w 
Staub, Robin h 



<120> Expression of Sedoheptulose 1,7 Bisphosphatase i 
Transgenic Plants 

<130> mobs014p 

<140> n/a 

<141> 2000-05-12 

<150> 60/133,964 
<151> 1999-05-13 



<160> 23 



<170> Patentln Ver. 2.1 

<210> 1 
<211> 21 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 
Primer 

<400> 1 

acatatgtgc gcgatcggcg a 



<210> 2 
<211> 25 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 
Primer 

<400> 2 

ggatccagaa gaagattatt aggcg 



<210> 3 
<211> 966 
<212> DNA 

<213> Triticum aestivum 



<400> 3 

atgtgcgcga tcggcgacag cctggaggag 
ctcatcaggc tgctgatctg catgggggag 
acggcctcct gcggcggcac ggcctgcgtc 
gacatgctcg ccgacaagct cctcttcgag 
gcgtgctctg aggaagtccc cgagctgcag 



ttcctgacca aggcgacgcc ggacaagaac 60 
gcgatgagga cgatcgcctt caaggtccgg 12 0 
aactccttcg gcgacgagca gctcgccgtc 180 
gcgttggagt actcccatgt gtgcaagtac 240 
gacatgggtg gcccggtcga aggtggattc 3 00 
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agtgtggcgt tcgaccccct 
accatcttcg gcgtctggcc 
gctgccgcca tgggcatcta 
cccgggacac acgaattcct 
acgagcatcg gagaagggaa 
cctgattatg acaagcttgt 
ggaggaatgg tccctgatgt 
gtgacgtcgc cgacggcgaa 
ttcttgatag agaaggccgg 
gtgatctccg tcctggacga 
cgcttcgagg agaccctcta 
accgcc 



tgacggctcc agcatcgtgg 
cggcgacaag ctgaccggcg 
cggccctcgc accaccttcg 
tctcctcgac gaaggtaaat 
gatgttctcc cctggcaatc 
caactactat gtgaaggaga 
caaccagatc atcgtgaagg 
ggcgaagctg cggctgctgt 
cgggcacagc agcgacggca 
gcggacccag gtggcctacg 
cggctcctcc agactcgccg 



acaccaactt caccgtggga 36 0 
tcaccggcgg tgaccaggtt 420 
tagttgccct caaggactgc 480 
ggcagcatgt caaggacacc 540 
tgagggccac gttcgacaac 600 
agtacactct gcgttacacc 660 
agaagggcat cttcacgaac 72 0 
tcgaggtggc gccgctgggg 780 
agcagtcggt gctggacaag 84 0 
gctccaagaa cgagatcatc 900 
ccagcgccac cgtcggcgcc 960 

966 



<210> 4 
<211> 22 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 
Primer 

<400> 4 

ttcctcagag cacgcgtact tg 22 

<210> 5 
<211> 53 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial- Sequence : Adaptor Primer 
<400> 5 

gggtcgacat tctagacaga attcgtggat cctttttttt tttttttttt ttt 53 



<210> 6 
<211> 22 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of ..Artificial Sequence : Synthetic - • 
Primer 

<400> 6 

catgggagta ctccaacgcc tc 22 



<210> 7 
<211> 21 
<212> DNA 

<213> Artificial Sequence 
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<220> 

<223> Description of Artificial Sequence: Adaptor Primer 
<400> 7 

gggtcgacat tctagacaga a 21 



<210> 8 

<211> 1208 

<212> DNA 

<213> Triticum aestivum 



<400> 8 

tctagagcag caatggagac cgtcgcggct 
tccccggcgt gctgcgccgc catgtccttc 
ccggcgacct cgttctacgg cgagtcgctg 
gcggggaggc agtccaaggc ggcgagccgg 
gacagcctgg aggagttctt gaccaaggcg 
atctgcatgg gggaggcgat gaggacgatc 
ggcacggcct gcgtcaactc cttcggcgac 
aagctcctct tcgaggcgtt ggagtactcc 
gtccccgagc tgcaggacat gggtggcccg 
ccccttgacg gctccagcat cgtggacacc 
tggcccggcg acaagctgac cggcgtcacc 
atctacggcc ctcgcaccac cttcgtagtt 
ttccttctcc tcgacgaagg taaatggcag 
gggaagatgt tctcccttgg caatctgagg 
cttgtcaact actatgtgaa ggagaagtac 
gatgtcaacc agatcatcgt gaaggagaag 
gcgaaggcga agctgcggct gctgttcgag 
gccggcgggc acagcagcga cggcaagcag 
gacgagcgga cccaggtggc ctacggctcc 
ctctacggct cctccagact cgccgccagc 
ttcttctg 



gccggctacg cccgcggggc cgccacgcgc 60 
tcgcagtcct acaggcccaa ggctgccagg 12 0 
cgggcgaaca cggcgaggac gtcgttcccg 180 
gcggcgctca ccacccggtg cgcgatcggc 240 
acgccggaca agaacctcat caggctgctg 300 
gccttcaagg tccggaccgc gtcctgcggc 36 0 
gagcagctcg ccgtcgacat gctcgccgac 420 
catgtgtgca agtacgcgtg ctctgaggaa 480 
gtcgaaggcg gattcagtgt ggcgttcgac 54 0 
aacttcaccg tgggaaccat cttcggcgtc 600 
ggcggtgacc aggttgctgc cgccatgggc 66 0 
gccctcaagg actgccccgg gacacacgaa 720 
catgtcaagg acaccacgag catcggagaa 780 
gccacgttcg acaaccctga ttatgacaag 840 
actctgcgtt acaccggagg aatggtccct 900 
ggcatcttca cgaacgtgac gtcgccgacg 960 
gtggcgccgc tggggttctt gatagagaag 1020 
tcggtgctgg acaaggtgat ctccgtcctg 1080 
aagaacgaga tcatccgctt cgaggagacc 1140 
gccaccgtcg gcgccaccgc ctaataatct 1200 

1208 



<210> 9 
<211> 322 
<212> PRT 

<213> Triticum aestivum 
<400> 9 

Met Cys Ala lie Gly Asp Ser Leu Glu Glu Phe Leu Thr Lys Ala Thr 
1 5 10 15 

Pro Asp Lys Asn Leu lie Arg Leu Leu lie Cys Met Gly Glu Ala Met 
20 25 30 

Arg Thr lie Ala Phe Lys Val Arg Thr Ala Ser Cys Gly Gly Thr Ala 
35 40 45 

Cys Val Asn Ser Phe Gly Asp Glu Gin Leu Ala Val Asp Met Leu Ala 
50 55 60 

Asp Lys Leu Leu Phe Glu Ala Leu Glu Tyr Ser His Val Cys Lys Tyr 
65 70 75 " 80 
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Ala Cys Ser Glu Glu Val Pro Glu Leu Gin Asp Met Gly Gly Pro Val 
85 90 95 

Glu Gly Gly Phe Ser Val Ala Phe Asp Pro Leu Asp Gly Ser Ser lie 
100 105 * ~ 110 

Val Asp Thr Asn Phe Thr Val Gly Thr lie Phe Gly Val Trp Pro Gly 
115 120 125 

Asp Lys Leu Thr Gly Val Thr Gly Gly Asp Gin Val Ala Ala Ala Met 
130 135 140 

Gly lie Tyr Gly Pro Arg Thr Thr Phe Val Val Ala Leu Lys Asp Cys 
145 150 155 160 

Pro Gly Thr His Glu Phe Leu Leu Leu Asp Glu Gly Lys Trp Gin His 
165 170 175 

Val Lys Asp Thr Thr Ser lie Gly Glu Gly Lys Met Phe Ser Pro Gly 
180 185 190 

Asn Leu Arg Ala Thr Phe Asp Asn Pro Asp Tyr Asp .Lys Leu Val Asn 
195 200 205 

Tyr Tyr Val Lys Glu Lys Tyr Thr Leu Arg Tyr Thr Gly Gly Met Val 
210 215 220 

Pro Asp Val Asn Gin He He Val Lys Glu Lys Gly He Phe Thr Asn 
225 230 235 240 

Val Thr Ser Pro Thr Ala Lys Ala Lys Leu Arg Leu Leu Phe Glu Val 
245 250 ~ 255 

Ala Pro Leu Gly Phe Leu He Glu Lys Ala Gly Gly His Ser Ser Asp 
260 265 270 

Gly Lys Gin Ser Val Leu Asp Lys Val He Ser Val Leu Asp Glu Arg 
275 280 285 

Thr Gin Val Ala Tyr Gly Ser Lys Asn Glu He He Arg Phe Glu Glu 
290 295 300 

Thr Leu Tyr Gly Ser Ser Arg- Leu Ala Ala Ser Ala Thr Val Gly Ala 
305 310 315 320 

. Thr Ala . - ^ - - - 



<210> 10 
<211> 17 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Degenerate 
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Primer 
<400> 10 

ggacathtty gggttgg 

<210> 11 
<211> 18 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial 
Primer 

<400> 11 

rtackargtr tayttytc 



17 



Sequence : Synthetic 



18 



<210> 12 
<211> 382 
<212> PRT 

<213> Chlorella sorokiniana 
<400> 12 

Met Gin Ala Thr Ala Val Ala Thr Ala Ala Pro Ala Ala Arg Val Ala 
1 5 10 15 

Thr Thr Gly Lys Ala Ala Thr Gly Val Lys Ala Ala Pro Arg Val Ala 
20 25 30 

Val Arg Ala Ala Gly Ala Ser Ala Ser Ser Ser Phe Ala Thr Gly Ala 
35 40 45 

Arg Leu Ser Lys Ala Ser Arg Thr Ala Ala Arg Arg Ala Ala Val Ala 
50 55 60 

Ala Gin Ala Lys He Gly Asp Thr Leu Glu Glu Phe Leu Leu Glu Ala 
65 70 75 80 

Thr Pro Asp Pro Lys Leu Arg Gin Leu Met Met Ser Met Ser Glu Ala 
85 90 95 

He Arg Thr He Ala Tyr Lys Val Arg Thr Ala Ser Cys Gly Gly Thr 
100 105 110 

Ala . Cys Val Asn Ser Phe Gly Asp Glu Gin Leu Ala Val Asp Leu Leu 
115 120 125 

Ala Asp Lys Leu Leu Phe Glu Ala Leu Lys Tyr Ser Gly Cys Cys Lys 
130 135 .140 

Leu Ala Cys Ser Glu Glu Val Pro Glu Pro Leu Asp Leu Gly Gly Glu 
145 150 ice 

155 i5o 

Gly Phe Ser Val Ala Phe Asp Pro Leu Asp Gly Ser Ser He Val Asp 
!65 170 175 
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Thr Asn Phe Ser Val Gly Thr He Phe Gly Val Trp Pro Gly Asp Lys 
180 185 190 

Leu Thr Gly He Thr Gly Arg Gin Gin Ala Ala Ala Gly Met Gly He 
195 200 205 

Tyr Gly Pro Arg Thr Val Phe Cys He Ala Leu Lys Asp Ala Pro Gly 
210 215 220 

Cys His Glu Phe Leu Leu Gin Asp Asp Gly Lys Trp Leu His Val Lys 
225 230 235 240 

Glu Thr Glu Thr He Gly Glu Gly Lys Met Phe Ser Pro Gly Asn Leu 
245 250 255 

Arg Ala Thr Phe Asp Asn Pro Ala Tyr Glu Lys Leu lie Ala Tyr Tyr 
260 265 270 

He Gly Glu Lys Tyr Thr Leu Arg Tyr Thr Gly Gly Met Val Pro Asp 
275 280 285 

Val Phe Gin He He Val Lys Glu Lys Gly Val Phe Thr Asn Val He 
290 295 300 

Ser Pro Ser Thr Lys Ala Lys Leu Arg Leu Leu Phe Glu Val Ala Pro 
305 310 315 320 

Leu Ala Leu Leu Val Glu Lys Ala Gly Gly Ala Ser Ser Cys Asp Gly 
325 330 335 

Leu Cys Val Ser Gly Leu Asp Val Glu Val Lys Gin His Asp Gin Arg 
340 345 350 

Thr Gin He Cys Tyr Gly Ser Lys Gly Glu Val Arg Arg Phe Glu Glu 
355 360 365 

Tyr Met Tyr Gly Asn Ser Pro Arg Phe Ser Glu Val Thr Ala 
370 * 375 380 



<210> 13 
<211> 22 
<212> DNA 

<213> Artificial Sequence 



<220> 
<223> 



Description of Artificial Sequence: Synthetic 
Primer 



<400> 13 

gatggtctcg gtctccttca eg 



22 



<210> 14 
<211> 18 
<212> DNA 
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<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 
Primer 

<400> 14 

cagccacttg ccatcgtc 



<210> 15 
<211> 22 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 
Primer 

<400> 15 

gatggtctcg gtctccttca eg 



<210> 16 

<211> 22 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 
Primer 

<400> 16 

tcctcagagc acgccagctt gc 22 



<210> 17 
<211> 24 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 
Primer 

<400> 17 

agetgetcat cgccgaacga gttg 24 



<210> 18 
<211> 22 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 
Primer 
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<400> 18 

agttcctgct gcaggacgat gg 22 



<210> 19 

<211> 23 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 
Primer 

<400> 19 

cccagtcacg acgttgtaaa acg 23 



<210> 20 
<211> 1460 
<212> DNA 

<213> Chlorella sorokiniana 
<400> 20 

gccggttgat cctcgagctc caaggcactc gggcacgatg caggccaccg ctgtcgccac 60 
cgccgcccct gcggcccgcg tcgccaccac tggcaaggcc gccaccggcg tcaaggccgc 120 
cccccgcgtg gccgtgcgcg ccgccggcgc cagcgccagc agcagctttg ccaccggcgc 180 
ccgcctgagc gccaaggcca gccgcaccgc cgcccgccgc gccgccgtgg ccgcccaggc 240 
caagatcggc gacacgctgg aggagttcct gctggaggcc acccccgacc ccaagctgcg 3 00 
ccagctcatg atgtccatgt ccgaggccat ccgcaccatc gcctacaagg tgcgcaccgc 360 
ctcgtgcggc ggcaccgcct gcgtcaactc gttcggcgat gagcagctgg ccgtcgacct 420 
gctggccgac aagctgctgt tcgaggccct caagtactct ggctgttgca agctggcgtg 480 
ctctgaggag gtgcctgagc ccctggacct gggcggcgag ggcttctccg tggcatttga 54 0 
ccccctggac ggctcctcca tcgtggacac caacttctct gtgggcacga tatttggggt 6 00 
gtggcccggc gacaagctga ccggcatcac gggccgccag caggccgccg ccggcatggg 660 
catctacggc ccccgcaccg tcttctgcat cgccctcaag gacgcccccg gctgccacga 720 
gttcctgctg caggacgatg gcaagtggct gcacgtgaag gagaccgaga ccatcggcga 780 
gggcaagatg ttctcccccg gcaacctgcg cgccaccttt gacaaccccg cgtacgagaa 840 
gctgatcgcc tactacatcg gcgagaagta cacgctgcgc tacaccggcg gcatggtgcc 900 
cgacgtgttc cagatcatcg tgaaggagaa gggcgtgttc accaacgtca tctccccctc 960 
caccaaggcc aagctgcgcc tgctgttcga ggtggcgccc ctggccctgc tggttgagaa 1020 
ggcaggcggc gcctcctcct gcgacggcct gtgcgtgagc ggcctggacg tggaggtcaa 1080 
gcagcacgac cagcgcaccc agatctgcta tggctccaag ggcgaggtgc ggcggtttga 1140 
ggagtacatg tacggcaact ccccccgctt ctccgaggtc accgcctaag cggctggtca 1200 
tcgcctgagc ggctcagtgc tgctgactat gcagccggcg gctgactatg ctggtcttaa 1260 
cctgagcggc tggccgtcaa acgctggcta gcagcgccgc cccctgagca gcctcggaga 1320 
ctcccgccgg ctggcctatt caagctggct ggcgccggag ctccgcctgc cctggttgca 1380 
cccaccattc gcttgctccc ccctccgcgc tttcatcatg tgctttccgc ccgcaacgcc l'440 
ctgtccaatt cattcattat 1460 



<210> 21 
<211> 55 
<212> DNA 

<213> Artificial Sequence 
<220> 
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<223> Description of Artificial Sequence: Synthetic 
Primer 

aaggJgcgca ccgcctcgtc cggcggcacc gcctccgtca actcgttcgg cgatg 55 

<210> 22 
<211> 1460 
<212> DNA 

<213> Chlorella sorokiniana 

iss ssss sss ass: s=s ass: s. 

55 ESS =5=5 5£5 E5I 5|| 

ass sssa 5= ass:: ssss sssg 
SSS iSS S3S isiS iii SS s 

SEE ESSS SSSS SS= SBK SKf 

?ga?gtg?tc cagatcatcg tgaaggagaa gggcgtgttc accaacgtca "tccccctc 960 

~ S2S SSSS 535SS SSSS ; S: 
S SSSS SSSS 25SS S3SSS 5 - 
SSSS 5S3K SSSS S33S5 S3S33 S 5SS 

ctcfcgccgg cfggcctatt caagctggct ggcgccggag ctccgcctgc cctggt gca 1380 
cccaccattc gcttgctccc ccctccgcgc tttcatcatg tgctttccgc ccgc g 
ctgtccaatt cattcattat 



<210> 23 
<211> 381 
<212> PRT 

<213> Chlorella sorokiniana 
<400> 23 



Met Gin Ala Thr Ala Val Ala Thr Ala Ala Pro Ala Ala Arg Val. Ala . 
X 5 1° 15 

Thr Thr Gly Lys Ala Ala Thr Gly Val Lys Ala Ala Pro Arg Val Ala 



20 



25 



Val Arg Ala Ala Gly Ala Ser Ala Ser Ser Ser Phe Ala Thr Gly Ala 
- - 40 45 



35 



Arg Leu Ser Ala Lys Ala Ser Arg Thr Ala Ala Arg Arg Ala Ala Val 
50 55 6° 
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Ala Ala Gin Ala Lys lie Gly Asp Thr Leu Glu Glu Phe Leu Leu Glu 
65 70 75 80 

Ala Thr Pro Asp Pro Lys Leu Arg Gin Leu Met Met Ser Met Ser Glu 
85 90 95 

Ala lie Arg Thr lie Ala Tyr Lys Val Arg Thr Ala Ser Ser Gly Gly 
100 105 110 

Thr Ala Ser Val Asn Ser Phe Gly Asp Glu Gin Leu Ala Val Asp Leu 
115 120 125 

Leu Ala Asp Lys Leu Leu Phe Glu Ala Leu Lys Tyr Ser Gly Cys Cys 
130 135 140 

Lys Leu Ala Cys Ser Glu Glu Val Pro Glu Pro Leu Asp Leu Gly Gly 
145 150 155 160 

Glu Gly Phe Ser Val Ala Phe Asp Pro Leu Asp Gly Ser Ser lie Val 
165 170 175 

Asp Thr Asn Phe Ser Val Gly Thr lie Phe Gly Val Trp Pro Gly Asp 
180 185 190 

Lys Leu Thr Gly lie Thr Gly Arg Gin Gin Ala Ala Ala Gly Met Gly 
195 200 205 

lie Tyr Gly Pro Arg Thr Val Phe Cys lie Ala Leu Lys Asp Ala Pro 
210 215 220 

Gly Cys His Glu Phe Gin Asp Asp Gly Lys Trp Leu His Val Lys Glu 
225 230 235 240 

Thr Glu Thr lie Gly Glu Gly Lys Met Phe Ser Pro Gly Asn Leu Arg 
245 250 255 

Ala Thr Phe Asp Asn Pro Ala Tyr Glu Lys Leu lie Ala Tyr Tyr lie 
260 265 270 

Gly Glu Lys Tyr Thr Leu Arg Tyr Thr Gly Gly Met Val Pro Asp Val 
275 280 285 

Phe Gin lie He Val Lys Glu Lys Gly Val Phe Thr Asn Val He Ser 
290 295 300 

Pro Ser Thr Lys Ala Lys Leu Arg Leu Leu Phe Glu Val Ala Pro Leu 
305 310 315 320 

Ala Leu Leu Val Glu Lys Ala Gly Gly Ala Ser Ser Cys Asp Gly Leu 
325 330 " 335 

Cys Val Ser Gly Leu Asp Val Glu Val Lys Gin His Asp Gin Arg Thr 
340 345 350 

Gin He Cys Tyr Gly Ser Lys Gly Glu Val Arg Arg Phe Glu Glu Tyr 
355 360 365 
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Met Tyr Gly Asn Ser Pro Arg Phe Ser Glu Val Thr Ala 
370 375 380 
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